Doubly Mixed-Effects Gaussian Process Regression

Doubly mixed-effects Gaussian process (DMGP) is a multi-task GP regression model that learns fixed and random effects across both samples and tasks (decomposition demonstrated in the figure above). Along with an example dataset, this repository includes implementations of:

Doubly mixed-effects Gaussian process (DMGP)
Translated mixed-effects Gaussian process (TMGP)
Mixed-effects Gaussian process (MGP)

This work is introduced in the following paper (PDF included in the repository):

Jun Ho Yoon, Daniel P. Jeong, and Seyoung Kim. Doubly Mixed-Effects Gaussian Process Regression. Proceedings of the 25th International Conference on Artificial Intelligence and Statistics (AISTATS), 2022.

If you find our work useful, please consider citing:

@InProceedings{pmlr-v151-ho-yoon22a,
  title = 	 { Doubly Mixed-Effects Gaussian Process Regression },
  author =       {Ho Yoon, Jun and Jeong, Daniel P. and Kim, Seyoung},
  booktitle = 	 {Proceedings of The 25th International Conference on Artificial Intelligence and Statistics},
  pages = 	 {6893--6908},
  year = 	 {2022},
  editor = 	 {Camps-Valls, Gustau and Ruiz, Francisco J. R. and Valera, Isabel},
  volume = 	 {151},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {28--30 Mar},
  publisher =    {PMLR},
  pdf = 	 {https://proceedings.mlr.press/v151/ho-yoon22a/ho-yoon22a.pdf},
  url = 	 {https://proceedings.mlr.press/v151/ho-yoon22a.html}
}

Setup

All of the models were implemented in GPflow & TensorFlow and tested on Linux & Mac OS. The following package versions were used:

TensorFlow Ver.	GPflow Ver.	Python Ver.
2.4.1	2.1.4	3.8.5

Other dependencies:

scikit-learn
numpy
pyyaml
tqdm
matplotlib

To set up the environment that we used, you can take the following steps:

1. Install Conda (Optional)

If you do not already have conda installed on your local machine, please install conda following the instructions here.

2. Import Conda Environment for DMGP

You can find the exported conda environment .yaml files under ./env, which can be used to replicate the environment that we used to develop our code. To import and create a new conda environment on your local machine, run on the command line:

conda env create -f ./env/dmgp_env_linux.yaml (if you are using Linux)
conda env create -f ./env/dmgp_env_mac.yaml (if you are using Mac OS)

You can also refer to conda's documentation on managing environments for more information.

3. Activate the Conda Environment

After Step 2, you can check whether the environment (named dmgp_env) was successfully created by running:

conda env list

which lists all of the conda environments available on your local machine. If dmgp_env is also listed, then you can activate it by running:

conda activate dmgp_env

Running the Code (Demo)

For demonstration, we include an example simulation dataset with 30 tasks and 100 samples under ./data. We performed a random 80-20 split along the samples to create the train and test data.

1. Specifying Settings for Training (Optional)

For each model, all training settings such as learning rate, the number of inducing points, and dataset path are to be specified in the corresponding yaml file under ./dmgp/params. For example, before running ./dmgp/train_dmgp.py, you can edit the ./dmgp/params/train_dmgp_params.yaml file to make changes to the default setting.

2. Training

After specifying what settings to use for training, move to ./dmgp and simply run on the command line one of the following scripts corresponding to your model of choice:

DMGP

python3 train_dmgp.py

TMGP

python3 train_tmgp.py

MGP

python3 train_mgp.py

All of the checkpoints and training results will be saved in the same working directory under a new folder called ./dmgp/logs.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
data		data
dmgp		dmgp
env		env
figures		figures
AISTATS2022.pdf		AISTATS2022.pdf
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Doubly Mixed-Effects Gaussian Process Regression

Setup

1. Install Conda (Optional)

2. Import Conda Environment for DMGP

3. Activate the Conda Environment

Running the Code (Demo)

1. Specifying Settings for Training (Optional)

2. Training

DMGP

TMGP

MGP

About

Releases

Packages

Contributors 2

Languages

License

SeyoungKimLab/DMGP

Folders and files

Latest commit

History

Repository files navigation

Doubly Mixed-Effects Gaussian Process Regression

Setup

1. Install Conda (Optional)

2. Import Conda Environment for DMGP

3. Activate the Conda Environment

Running the Code (Demo)

1. Specifying Settings for Training (Optional)

2. Training

DMGP

TMGP

MGP

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages